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Abstract. Social media has been contributing many research areas such 
as data mining, recommender systems, time series analysis, etc. How¬ 
ever, there are not many successful applications regarding social media 
in government agencies. In fact, lots of governments have social media 
accounts such as twitter and facebook. More and more customers are 
likely to communicate with governments on social media, causing mas¬ 
sive external social media data for governments. This external data would 
be beneficial for analysing behaviours and real needs of the customers. 
Besides this, most governments also have a call centre to help customers 
solve their problems. It is not difficult to imagine that the enquiries on 
external social media and internal call centre may have some coupling 
relationships. The couplings could be helpful for studying customers’ in¬ 
tent and allocating government’s limited resources for better service. In 
this paper, we mainly focus on analysing the coupling relations between 
internal call centre and external public media using time series analysis 
methods for Australia Department of Immigration and Border Protec¬ 
tion. The discovered couplings demonstrate that call centre and public 
media indeed have correlations, which are significant for understanding 
customers’ behaviours. 


1 Introduction 

Social media has been attracting more and more attention from many research 
areas such as data mining, text mining, recommender systems [17] and etc. The 
related topics in social media mainly include: event discovery, group or commu¬ 
nity discovery, social network analysis, personalized recommendation, sentiment 
analysis and so on. However, regarding the applications of social media analysis, 
there is little work in government agencies, actually even the mature techniques 
have not been well applied in governments such as the Australia Department of 
Immigration and Border Protection (DIBP). 

Much helpful information such as immigration or visa-related policies has 
been posted by DIBP’s accounts on twitter. The people who are of interest to 
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Australia may pay attention to DIBP’s tweets. For example, some users may 
reply to the tweets by asking further questions or just expressing a positive or 
negative opinion, others may repost the tweets to propagate the information to 
their friends. We believe that the study and analysis of the social media will 
potentially improve and better understand the business of the governments. In 
addition to communicate with DIBP on social media websites, many people are 
likely to directly ring DIBP’s call centre to ask their questions or check the 
progress of visa applications. This internal call centre data is very helpful for 
discovering people’s interests on DIBP and allocating the limited resources for 
better public service. The internal call centre data may also have some couplings 
with the external social media data. For example, during one period, the call 
centre may receive many queries about visa applications. If we check the so¬ 
cial media websites, maybe there are many visa-related queries as well. When 
Australia commences a new policy, probably there will be many discussions re¬ 
garding this policy on social media websites, the calls regarding the policy will 
be also increasing in the following days or weeks. This motivates us to meditate 
the following questions: (1) Whether the internal call centre and external social 
media have correlations? (2) If yes, is there some time lag between the two plat¬ 
forms? (3) How the significant events related to DIBP influence the internal call 
centre and external social media and their correlations? Based on the internal 
call centre data, DIBP has many predictive models such as risk evaluation, intent 
analysis. We believe that the consideration of external data and the correlations 
with internal call centre data would contribute to improving the performance of 
predictive or monitoring models. 

To analyse the couplings between call centre and public media, we first give 
a toy example in table to illustrate how the data look like. The essential 
attributes of every phone call includes “ID”, “Date”, “Duration”, “Disposition 
Code” and “Disposition Code Text”. By these attributes, the exact time and the 
duration of the phone call are recorded, after the interaction, DIBP’s call centre 
agent will categorize this enquiry into different topics by “Disposition Code” and 
the detail attribute “Disposition Code Text”. Similarly on social media, every 
tweet also has many attributes such as “tweetID”, “tweetText”, “postTime”, 
and “userName”. The attributes of tweets does not contain the disposition code 
for categorization, but we can use text mining techniques to determine the topic 
information based on “tweetText” attribute. 


Table 1. A Toy Example for Call Centre 


ID 

Date 

Duration 

Disposition Code 

Disposition Code Text 

IDl 

6/12/2013 

Imin 

457 visa 

457 visa application progress 

ID2 

6/12/2013 

10 min 

student visa 

574 visa 

ID3 

6/12/2013 

3 min 

457 visa 

457 visa application progress 

ID4 

7/12/2013 

15 min 

600 visit 

600 visit visa 

ID5 

7/12/2013 

Imin 

600 visit 

600 electronic visa 

ID6 

7/12/2013 

5min 

skilled migration 

skilled selection 
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Because both of the data sets contain the “time” information, we could trans¬ 
form the data sets into time series, then apply time series analysis methods to 
study the coupling relations. The contributions of the paper are concluded as 
follow: 

— We automatically extracted DIBP’s external twitter data using R program¬ 
ming language; 

— We investigated the characteristics of DIBP’s internal call centre data by 
time series analysis; 

— We analysed the correlations between internal call centre and social media 
by a symbolic representation time series analysis method; 

— We conducted experiments to verify the couplings between the internal and 
external data. 

The rest of the paper is organized as follows. Section 2 presents the related 
work. In Section 3, we propose and detail the coupling analysis framework for call 
centre and public media. Experimental analysis and some discovered couplings 
are presented in Section 4 followed by the conclusion. 


2 Related Work 

With the development of social network, more and more researchers have been 
trying to incorporate social media into relevant research areas such as sentiment 
analysis [14], time series analysis [T9|, recommender svstems[T3]. text mining [TS] 
etc. Regarding time series analysis, the main research topics include: time series 
index and search|4][3][2], clustering [9][T^[IT] . classification [T5][6][5] . summariza¬ 
tion, anomaly detection and so on. For example, through some similarity or 
dissimilarity measures p!] [TB]. we can query relevant time series, or partition sim¬ 
ilar time series into a group by clustering or classification methods. However, 
there are not many successful applications to study the couplings between inter¬ 
nal data and external social media data. This strongly motivates us to analyse 
the couplings using time series analysis techniques. 

Time series changing trends are often affected by some seasonal patterns such 
as cultural festivals like Christmas or Chinese New Year, to better understand 
the time series, it is often helpful to remove the seasonal patterns which can con¬ 
ceal the true underlying movements in the series. The main methods for time 
series decomposition [7] include: moving average, additive decomposition, mul¬ 
tiplicative decomposition, ARIMA decomposition. In this paper, the simple and 
mature additive decomposition method implemented in R is applied to remove 
the seasonal patterns. 

3 Coupling Analysis Framework 

The framework for correlation analysis in this paper is shown in Fig. The 
main steps include: 
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— Extracting external twitter data; 

— Integrating different call centre data into specific topics using merging rules; 

— Transforming external and internal data into time series; 

— Applying time series decomposition on call centre data to remove seasonal 
patterns; 

— Computing correlations similarities and lagged time between call centre and 
public media; 

— Using symbolic representation based weekly aggregation on time series for 
analysing the correlations ; 

— Extracting interesting changing patterns based on computed similarities and 
symbolic changing trends. 



Fig. 1. Goupling Analysis for Gall Gentre and Twitter 


In this section, we introduce the main techniques in this framework. 


3.1 Call Centre Data Integration 

DIBP has a mainland call centre in Canberra and international call centres in 
London and Ottawa, the categorization labels such as “Disposition Code” and 
“Disposition Code Text” are handled by different staff from different countries, 
having different file formats. Eig. is an example for Canberra’s call centre 
data. It is reasonable to integrate different call centre data together. Therefore, 
we reorganize the categorization levels and group all the enquiry records into 
six general categories: citizen, work, study, visit, permanent and other by the 
following merging rules: 

— Rule 1: I strident 156015701571157215731574157515761 temp ^rad ^ study 

— Rule 2: 600|676|e600|e651|e676|eta|transtt|6i45mess visitor\eVisitor\visitor\ 
sponsor visitor\medical treatment\carer visit 
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Fig. 2. Example of Call Centre in Canberra 


Rule 3: 400|417|456|457|462|e400|e417|e457|e462|temp long\temp short\whm 


work 


— Rule 4: gsm\family migration\professional migration\partner migration\ 

rrv\adoption\adult migrant Englishprogram\business skills\cers\employer 
sponsored\employees\employers\NZ Family Relationship\parent\refugee\ 
remaining relative\skilled migration\skill seleet\f amily and living permanent 

— Rule 5: eitizenship\eonferral\deelaratory visa\deseent\renuneiation\ 
resumption eitizen 

Applying these rules, we can integrate different call centre data into a com¬ 
bined internal data set for deeper analysis. 

3.2 Time Series Analysis 

Time Series Transformation The two attributes “Disposition Code” and 
“Disposition Code Text” are significant for investigating the correlations for 
specific topics, because the attributes are supervised and evaluated by the do¬ 
main experts. If we transform the internal call centre data by time series using 
frequency and percentage strategy as shown in Eqn. and the interaction 
records in Table [T] could be transformed to Table O 


frequeney{eiRi) = num{eiRi) 
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where, num{ci^ti) is the volume number of the queries for specific topic q on 
time ti, and num{ti) is the number of all the enquiries on ti. 


Table 2. Time Series for Gall Gentre 


Date 

Topic 

Frequency 

Percentage 

6/12/2013 

457 visa 

2 

2/3 

6/12/2013 

student visa 

1 

1/3 

7/12/2013 

600 visit 

2 

2/3 

8/12/2013 

skilled migration 

1 

1/3 


Time series analysis 
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Time 


Time series analysis 
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Time 


Category 

citizen 

- permanent 


Fig. 3. Frequency and Percentage Strategy for Time Series Analysis 


Through the above two strategies, the call centre data can be transformed 
into topical time series. Fig. shows the transformed topical time series respec¬ 
tively for frequency and percentage strategies. To analyse the couplings between 
call centre and twitter, we also need to convert the public media data into the 
similar topical time series. Keyword searching is the basic method for this trans¬ 
formation, the main steps include: 

— Based on the extracted twitter data, we first apply keyword searching to get 
specific topical data. For instance, the data regarding “student visa” topic 
could be collected by combining searching results of “student visa”, “560”, 
“570”, “57r’, “572”, “573”, “574”, “575”, and “576”. 

— For specific topic, we respectively apply the above frequency and percentage 
strategies to transform the searched data into topical time series. 
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Time Series Decomposition From the combined call centre data, basically 
we can get five groups of time series regarding different topics. From the call 
centre data of DIBP, there are no enquires during weekends and public holidays 
because call centre is not open during that time, causing seasonal sharp troughs 
for the time series changing trends. This seasonal effect is a systematic and calen¬ 
dar related effect caused by public holidays. To better understand the changing 
patterns of the time series, the observed data needs to be seasonally adjusted as 
seasonal effects can conceal both the true underlying movement in the series, as 
well as certain non-seasonal characteristics which may be of interest to analysts 
and business. Thus, the time series decomposition method is applied to remove 
the seasonal effect on time series. 

It is well known that time series can be decomposed into three components: 
the trend (long term direction), the seasonal and the irregular (unsystematic, 
short term fluctuations). The trend is defined as the dong term’ movement in a 
time series without calendar related and irregular effects, and is a reflection of the 
underlying level. The seasonal component consists of effects that are reasonably 
stable with respect to timing, direction and magnitude. It arises from system¬ 
atic, calendar related influences such as: natural weather conditions; business or 
administrative procedures; social and cultural behaviour such as Christmas and 
Chines New Year. The irregular component is what remains after the seasonal 
and trend components of a time series have been estimated and removed. It re¬ 
sults from short term fluctuations in the series which are neither systematic nor 
predictable. 


Decomposition of additive time series 





Fig. 4. Example of Time Series Decomposition 
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Decomposed Time series analysis 



DecOIDecIS Jan OUan 15 Feb O'Feb l&larOI 

Time 


- pern'ianent 


Fig. 5. Frequency and Percentage Strategy for Decomposed Time Series Analysis 


To decompose the call centre time series data, we apply the “decompose” 
function by R language using moving averages with addictive model. Specifically, 
this function first determines the trend component using a moving average with 
equal weighted symmetric window, and removes it from the time series. Then, 
the seasonal figure is computed by averaging method over all periods for each 
time unit. The seasonal figure is then centered. Finally, the irregular component 
is determined by removing trend and seasonal figure from the original time series. 
Fig. 1^ shows an example of the time series decomposition for “visit” category. 
Fig. shows the decomposed time series based on frequency and percentage 
strategies. 


3.3 Correlation Analysis 

Cross-Correlation Analysis In time series analysis, as applied in statistics 
and signal processing, the cross correlation between two time series describes the 
normalized cross covariance function. The cross correlation of two time series can 
be estimated by averaging the product of samples measured from one process 
and samples measured from the other time series. The samples included in the 
average can be an arbitrary subset of all the samples in the time series. For a 
large number of samples, the average converges to the true cross-correlation. 

Let {xt^yt) represent two time series, then the cross correlation is given by 
Eqn. 

C OTTxyir^ (^) 

where fix and {ly are the means of Xt and yt respectively. 
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Lagged Correlation Lagged correlation refers to the correlation between two 
time series shifted in time relative to one another. Lagged correlation is impor¬ 
tant for studying the couplings between time series for two reasons. First, one 
series may have a delayed response to the other series, or perhaps a delayed 
response to a common stimulus that affects both series. For example, some peo¬ 
ple may first access social media to solve their problems because of the quick 
response of social media. If they cannot find the answer, they may choose to call 
the authorities regarding the problem. Second, the response of one series to the 
other series or an outside stimulus may be smeared in time, such that a stimulus 
restricted to one observation elicits a response at multiple observations. 

The simple correlation coefficient between two time series properly aligned in 
time is inadequate to analyse the couplings. A very useful function for checking 
lagged correlations is the sample cross correlation function (CCF). The CCF 
function is the correlation between the series shifted against one another as 
a function of number of observations of the offset. If the individual series are 
autocorrelated, the estimated cross-correlation function may be distorted and 
misleading as a measure of the lagged relationship. 

In the relationship between two time series {yt and Xt), the series yt may be 
related to past lags of the x-series. The CCF function is helpful for identifying 
lags of the x-variable that might be useful predictors of yt. In this paper, we use 
this CCF function defined as the set of sample correlations between Xt+h and 
yt for h = 0, ±1, ±2, ±3... to determine the time lags between call centre and 
twitter. A negative value for h is a correlation between the x-variable at a time 
before t and the ^-variable at time t. For instance, consider h = —2, the CCF 
value would give the correlation between Xt -2 and yt. Simply, the following two 
rules are applied to identify whether one time series leads or lags the other time 
series. 

— When one or more Xt-\-h^ with h negative, are predictors of yt, it is said that 
X leads y. 

— When one or more with h positive, are predictors of yt, it is called that 
X lags y. 

According to the above rules, we can set different positive value for h to ex¬ 
amine the delayed time between time series x and time series y. After calculating 
the cross-correlation between the two time series, the maximum (or minimum if 
the two series are negatively correlated) of the cross-correlation function indi¬ 
cates the point in time where the time series are best aligned, i.e. the time delay 
between the two series is determined by the argument of the maximum of the 
cross-correlation as in Eqn. 


Tdeiay = arg max {Carr {xt+u, Vt)) (4) 

h 

Time Series Symbolic Representation Through the time series decompo¬ 
sition method, we can partly remove the seasonal patterns. However, if we look 
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at the decomposed time series for call centre data as shown in Fig. it can 
be clearly seen that there is still some big decrease during weekends and public 
holidays, probably because the seasonal patterns are not completely removed. 
Therefore, we apply a symbolic aggregate approximation (SAX) [10] method to 
transform the real valued time series into symbolic representations. SAX is the 
first symbolic representation for time series that allows for dimensionality reduc¬ 
tion and indexing with a lower-bounding distance measure. Symbolic Aggregate 
approximation was proposed by Lin et al. and extends piecewise aggregate ap¬ 
proximation (PAA) based approach |8| inheriting algorithm simplicity and low 
computational complexity while providing satisfiable sensitivity and selectivity 
in range-query processing. 

SAX transforms a time-series x of length n into the string of arbitrary length 
cj, where uj <n typically, using an alphabet A of size a > 2. The SAX algorithm 
consist of two steps: during the first step it transforms the original time-series 
into a PAA representation and this intermediate representation gets converted 
into a string during the second step. Use of PAA at the first step brings the 
advantage of a simple and efficient dimensionality reduction while providing the 
important lower bounding property. The second step, actual conversion of PAA 
coefficients into symbols, is also computationally efficient. 

To better analyse the correlations between call centre and public media, 
we follow this direction and first transform the original time series into a PAA 
representation and then apply the symbolic representation of the changing trends 
to help understand the correlations. 



User 

Interface 





R Shiny 





V 


Correlation 

Analysis 


Fig. 6. Architecture of the Interface for Gorrelation Analysis 


Correlation Analysis Interface We implemented a friendly interface with 
the architecture in Fig. for correlation analysis between call centre and so¬ 
cial media. Users can simply select one category to see the correlations during 
different time periods. Then the cross-correlations and time lagged correlations 
by different strategies can be clearly seen from the interface. For Twitter and 
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call centre, the interface includes eight categories: study, visit, work, permanent, 
citizen, refugee, skilled permanent, and working holidays. 


4 Experimental Analysis 


The experimental steps are as follow: we first extract external twitter data set, 
and integrate call centre data and twitter data together, then apply time series 


transformation introduced in Section to transform the data sets into topical 
time series. After that, we use different correlation analysis methods to study 
the couplings between call centre and twitter. 


4.1 Data Set 

Social Media Data We use an open R package “twitteR” to extract all the 
tweets posed by “DIBPAustralia” and by DIBP’s important followers and friends 
who have more than 500 followers. In addition, we also manually choose 32 
twitter users such as “©lAUSVisa”, Australiadu”, Work-Visa” who are 
doing visa or migration business to help people get Australian visas. The tweets 
posted by these 32 users are extracted as well, because they are closely related to 
“visa” or “migration” topic. In total, this extracted twitter data has 1.45 million 
tweets ranging from 24 March 2007 to 05 March 2014. 



Fig. 7. Frequency Strategy for Visit Category 


Call Centre Data In this paper, we combine DIBP’s three different call centre 
data respectively from Canberra, Ottawa, and London. The details of the three 
call centre data are as follow: 

— Canberra Call Centre Data includes 352,351 enquiry records ranging from 
02-Dec-2013 to Ol-March-2014. After a phone call is finished, DIBP’s call 
centre agents manually categorize this enquiry into different topics shown in 
“DISPOSITION” fields. 
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call centre call centre 



Fig. 8. Percentage Strategy for Visit Gategory 


— Ottawa Call Centre Data contains 103,069 records (emails and phone calls) 
ranging from 2-July-2013 to 12-March-2014. Field “Category” shows the 
topics of the enquires. 

— London Call Centre Data includes 175,450 records ranging from Ol-Jan-2013 
to 28-Feb-2014. 


Since the three call centre data are from different countries and have different 
labels to identify topic information, we integrate these call centre data together 
by the merging rules introduced in Section |3.1[ After this integration, we get 
the completely overlapped combined call centre data including 387,037 records 
ranging from Ol-Dec-2013 to 28-Feb-2014. 


4.2 Typical Discovered Correlations 

In visit category as shown in fig. and [^during 01 Dec, 2013 to 21 Dec, 2013, no 
delay with the probability 0.86 for frequency strategy, and twitter lags call centre 
one day with the correlation value -0.44 for percentage strategy. The results of 
the symbolic representation also show that frequency-based trends positively 
correlated and percentage-based trends negatively correlated. 

In study category as shown in fig. i andp^ during 01 Dec, 2013 to 14 Dec, 
2013, we observe that call centre delays three days with the correlation -0.58 for 
frequency strategy, but no delay using percentage strategy with the similarity 
0.29. From the symbolic representation, the results show that the two changing 
trends are negatively correlated during this period. 

In permanent category as shown in fig. pT] and \T^ during 02 Feb, 2014 to 15 
Feb, 2014, call centre lags one day for frequency strategy with the probability 
0.62, while lags five days using percentage strategy with the similarity -0.46. The 
weekly aggregated results of symbolic representation have the same changing 
trends. 

In work category as shown in fig, and during 12 Jan to 25 Jan, there is 
no delay for call centre and twitter with probability 0.74 for percentage strategy. 
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Fig. 9. Frequency Strategy for Study Category 


call centre call centre 



Fig. 10. Percentage Strategy for Study Category 


call centre call centre 



Fig. 11. Frequency Strategy for Permanent Category 


call centre call centre 



Fig. 12. Percentage Strategy for Permanent Category 
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However, call centre delays three days with -0.5 for frequency strategy. The sym¬ 
bolic results also demonstrate the same changing trends that frequency-based 
trends are negatively correlated, but percentage-based trends are positively cor¬ 
related. 



Tim* Tim* 



I. 

I 




Fig. 13. Frequency Strategy for Work Gategory 


call cenire call centre 



Tim* Tim* 


Fig. 14. Percentage Strategy for Work Gategory 


In citizen category as shown in fig. \TE\ and pTj during 02 Feb to 15 Feb, no 
delay with the correlation value 0.65 for frequency strategy, but call centre delays 
two days with -0.61 for percentage strategy. Differently, the symbolic representa¬ 
tion results indicate that frequency-based trends are negatively correlated, and 
percentage-based trends are positively correlated. 

4.3 Discussions 

From the above discovered couplings between call centre and twitter, we can 
clearly see that the two data sources indeed have some coupling relations and 
time lagged correlations. The time lagged correlation can be concluded as three 
situations: 1) call centre lags twitter is the most common case, which means 
that social media normally has a quicker response regarding different topics. 
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call centre call centre 



Fig. 15. Frequency Strategy for Citizen Category 


call centre call centre 



Fig. 16. Percentage Strategy for Citizen Category 


and people will turn to call centre to verify or confirm the discussed information 
on twitter. 2) call centre and twitter has no delay means that some events have 
big infiuence so that the discussions regarding the topic on twitter and call centre 
both increase quickly. 3) twitter lags call centre is not very common but indeed 
happen in our discovered couplings, which represent that some people prefer to 
directly communicate with DIBP to get the answers, then to share or discuss 
further information with their friends on social media. 

It is still difficult to determine whether the correlations are influenced by 
some relevant events or policies related to DIBP, because the significant events 
themselves are not easy to be confirmed. But the positive response from business 
staff in DIBP can partly demonstrate that this coupling learning between call 
centre and twitter is very interesting and meaning. 

5 Conclusion and Future Work 

In this paper, we studied the coupling analysis between DIBP’s call centre and 
external twitter data, and proposed a coupling analysis framework. First, we 
transformed the extracted twitter and call centre data to time series, then ap¬ 
plied time series decomposition to remove seasonal patterns. After that, time 
series correlation analysis with frequency and percentage strategies was used 
to discover the couplings. Besides this, to better understand the couplings, we 
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also applied the symbolic representation method to show the couplings. The ex¬ 
perimental analysis parts demonstrated that the two data sources indeed have 
coupling relations, which are significant for business. 

The call centre data currently only covers three months from December 2013 
to February 2014, which is not enough to get a convincible analysis results. To 
continue more effective analysis, it is necessary to collect more call centre data 
for overcoming the limitations of data sets. 

Regarding the correlation analysis part, currently we just analysed the overall 
changing trends and their correlations on call centre and social media. We actu¬ 
ally found that correlations indeed exist between the two data sources. However, 
the events or significant policies which incur the correlations and the changing 
trends are still not clear, because just very rare policies have been published 
on social media websites, which cause tough to find the related policies. More 
significant policies could be beneficial for deeper understanding of correlations. 
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